Introducing ROC Curves as Error Measure Functions: A New Approach to Train ANN-Based Biomedical Data Classifiers

نویسندگان

  • Raúl Ramos-Pollán
  • Miguel Ángel Guevara-López
  • Eugénio C. Oliveira
چکیده

This paper explores the usage of the area (Az) under the Receiver Operating Characteristic (ROC) curve as error measure to guide the training process to build machine learning ANN-based classifiers for biomedical data analysis. Error measures (like root mean square error, RMS) are used to guide training algorithms measuring how far solutions are from the ideal classification, whereas it is well known that optimal classification rates do not necessarily yield to optimal Az’s. Our hypothesis is that Az error measures can guide existing training algorithms to obtain better Az’s than other error measures. This was tested after training 280 different configurations of ANNbased classifiers, with simulated annealing, using five biomedical binary datasets from the UCI machine learning repository with different test/train data splits. Each ANN configuration was trained both using the Az and RMS based error measures. In average Az was improved in 7.98% in testing data (9.32% for training data) when using 70% of the datasets elements for training. Further analysis reveals interesting patterns (Az improvement is greater when Az are lower). These results encourage us to further explore the usage of Az based error measures in training methods for classifiers in a more generalized manner.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Measure Oriented Training Scheme for Imbalanced Classification Problems

Since the overall prediction error of a classifier on imbalanced problems can be potentially misleading and biased, it is commonly evaluated by measures such as G-mean and ROC (Receiver Operating Characteristic) curves. However, for many classifiers, the learning process is still largely driven by error based objective functions. As a result, there is clearly a gap between the measure according...

متن کامل

Overlaying classifiers: a practical approach for optimal ranking

ROC curves are one of the most widely used displays to evaluate performance of scoring functions. In the paper, we propose a statistical method for directly optimizing the ROC curve. The target is known to be the regression function up to an increasing transformation and this boils down to recovering the level sets of the latter. We propose to use classifiers obtained by empirical risk minimiza...

متن کامل

Receiver operating characteristic ( ROC ) and other curves measuring discriminability of classifiers ’ ensemble for asthma

Purpose: The aim was studying the discriminability by ROC curves and gain charts for simple fixed combining of constituent classifiers, for asthma severity diagnosis, and also for bagging and boosting. Material and methods: ROC shows a performance over a range of relative costs and probabilities a priori. Area under ROC curve (AUC) is the measure of separability of two probability distributions...

متن کامل

Development and comparison of automated classifiers for glaucoma diagnosis using Stratus optical coherence tomography.

PURPOSE To develop and compare the ability of several automated classifiers to differentiate between normal and glaucomatous eyes based on the quantitative assessment of summary data reports from Stratus optical coherence tomography (OCT; Carl Zeiss Meditec Inc., Dublin, CA) in a Chinese population in Taiwan. METHODS One randomly selected eye from each of 89 patients with glaucoma and each of...

متن کامل

Introducing a New Model for Individual Cognitive Factors Influencing Human Error Based on DEMATEL Approach

Background and Objectives: The recognition of a system failure causes and their related factors are considered as the most important factor in preventing accident occurrence in different organizations including industries. Human error is a known important factor in unpredictable events of which cognitive factors are the most influential ones. The purpose of this study was to introduce a new mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010